Picture for Rui Zheng

Rui Zheng

ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development

Add code
Jan 16, 2026
Viaarxiv icon

SimLLM: Fine-Tuning Code LLMs for SimPy-Based Queueing System Simulation

Add code
Jan 10, 2026
Viaarxiv icon

AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning

Add code
Sep 10, 2025
Viaarxiv icon

Speech-Language Models with Decoupled Tokenizers and Multi-Token Prediction

Add code
Jun 14, 2025
Figure 1 for Speech-Language Models with Decoupled Tokenizers and Multi-Token Prediction
Figure 2 for Speech-Language Models with Decoupled Tokenizers and Multi-Token Prediction
Figure 3 for Speech-Language Models with Decoupled Tokenizers and Multi-Token Prediction
Figure 4 for Speech-Language Models with Decoupled Tokenizers and Multi-Token Prediction
Viaarxiv icon

Improving RL Exploration for LLM Reasoning through Retrospective Replay

Add code
Apr 19, 2025
Viaarxiv icon

ImplicitCell: Resolution Cell Modeling of Joint Implicit Volume Reconstruction and Pose Refinement in Freehand 3D Ultrasound

Add code
Mar 09, 2025
Viaarxiv icon

Better Process Supervision with Bi-directional Rewarding Signals

Add code
Mar 06, 2025
Viaarxiv icon

Multi-Class Segmentation of Aortic Branches and Zones in Computed Tomography Angiography: The AortaSeg24 Challenge

Add code
Feb 07, 2025
Figure 1 for Multi-Class Segmentation of Aortic Branches and Zones in Computed Tomography Angiography: The AortaSeg24 Challenge
Figure 2 for Multi-Class Segmentation of Aortic Branches and Zones in Computed Tomography Angiography: The AortaSeg24 Challenge
Figure 3 for Multi-Class Segmentation of Aortic Branches and Zones in Computed Tomography Angiography: The AortaSeg24 Challenge
Figure 4 for Multi-Class Segmentation of Aortic Branches and Zones in Computed Tomography Angiography: The AortaSeg24 Challenge
Viaarxiv icon

CDI: Blind Image Restoration Fidelity Evaluation based on Consistency with Degraded Image

Add code
Jan 24, 2025
Viaarxiv icon

Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision

Add code
Nov 25, 2024
Figure 1 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Figure 2 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Figure 3 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Figure 4 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Viaarxiv icon